Automatic formatted transcripts for videos
نویسندگان
چکیده
Multimedia content may be supplemented with time-aligned closed captions for accessibility. Often these captions are created manually by professional editors — an expensive and timeconsuming process. In this paper, we present a novel approach to automatic creation of a well-formatted, readable transcript for a video from closed captions or ASR output. Our approach uses acoustic and lexical features extracted from the video and the raw transcription/caption files. We compare our approach with two standard baselines: a) silence segmented transcripts and b) text-only segmented transcripts. We show that our approach outperforms both these baselines based on subjective and objective metrics.
منابع مشابه
International Journal of advanced studies in Computer Science and Engineering
Many organizations and universities provide distance learning by recording classroom lectures and making them available to students over the Internet. A repository generally contains hundreds of such lecture videos. Each lecture video is typically a more than hour’s duration and is often huge. It is sometimes clumsy for students to search through an entire video, or across many videos, in order...
متن کاملMismatch interpretation by semantics-driven alignment∗
This paper describes a method for the alignment of automatically recognized speech transcripts with formatted documents manually derived from the speech recognition results. Novel features of our alignment method are a parametrizable scoring function, an intelligent tokenization system drawing on domain knowledge, and semantic comparisons. The field of application are dictated medical reports p...
متن کاملDigital Watermarking Technology in Different Domains
Due to high speed computer networks, the use of digitally formatted data has increased many folds.The digital data can be duplicated and edited with great ease which has led to a need for effectivecopyright protection tools. Digital Watermarking is a technology of embedding watermark withintellectual property rights into images, videos, audios and other multimedia data by a certainalgorithm .Di...
متن کاملTUD-MIR at MediaEval 2011 Genre Tagging Task: Query expansion from a limited number of labeled videos
In this paper we present results of our initial research on genre tagging. We approach the task from information retrieval perspective using a relatively small number of labeled videos in the development set to mine query expansion terms characteristic of each genre. We also investigate which sources of information associated with the videos or extracted from their audio channel, e.g. title, de...
متن کاملVideo Indexing and Automatic Caption Creation
This paper presents the design and implementation of a video indexing and automatic caption creation system. The system is able to extract audio from videos and to get the transcript directly from the audio file using the newly designed audio-to-text engine based on Hidden Markov Model (HMM). Transcripts can be edited and the corresponding time stamps are updated automatically. The video indexi...
متن کامل